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Recurrence plots and recurrence quantification analysis have become popular in the last two 
decades. Recurrence based methods have on the one hand a deep foundation in the theory of 
dynamical systems and are on the other hand powerful tools for the investigation of a variety 
of problems. The increasing interest encompasses the growing risk of misuse and uncritical 
application of these methods. Therefore, we point out potential problems and pitfalls related to 
different aspects of the application of recurrence plots and recurrence quantification analysis. 
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1. Introduction 



Since its introduction in 1987 bv lEckmann et al\ 19871], and the development of different quantification ap- 
proaches, recurrence plots (RPs) have been widely used for the investigation of complex systems in a variety 
of dif f erent discipli n es, as physiology, ecol o gy, finance or earth sc i ences [ e.g., Mar wan. 2 0081: Schinkel et al. . 
20071: IZbihit et all l2004l : iFacchim et ^ . l2007l : iBelaire-Frand] . l2004l : IPecail . i2003 : Trauth et a,l\ . 12003 



Cermakl . 120091 ] . RPs may attract attention because of their ability to produce beautiful or fancy pictures 



as in the case of the colourful representations of fractal sets Mandelbrotl . 119821 ] . The recent remarkable in- 
crease of applications can be traced down in part to several free software packages available for calculating 
recurrence plots and the corresponding recurrence quantification analysis (RQA). Since these methods are 
also claimed to be very powerful even for short and non-stationary data, we should be careful not to con- 
sider them as a kind of a magic tool, which works on all kinds of data. Owing to the fact that these methods 
are indeed in some sense powerful and rather adaptable to various problems, it is really important that 
the user knows how these methods work and has understood the ideas behind the RP and the measures of 
complexity derived from it. Any uncritical application will lead to serious pitfalls and mis-interpretations. 
As the number of applicants increases, the risk of careless application of RPs and RQA grows. 

In this article we try to highlight some of the pitfalls which can occur during the application of RPs 
and RQA and present future directions of research for a deep theoretical understanding of the method. 



2. Recurrence plots and recurrence quantification 

Although similar methods already existed before, the RP, Rij = Q{e — \\xi — Xj 

dynamics of a dynamical system by using its phase space trajectory was introduced by lEckmann et al. 



for th e analysis of the 



19871 ] ■ This method can be used in order to visuaUse the recurrence of a state, i.e., all the times when this 
state will recur. In the 1990's, a heuristic appr oach of quantification RPs by its line structures h as led to 
the recurrence quantification analysis (RQA) [Webber Jr. &: Zbilutl . Il994l : iMarwan et al\ . l2002bf | . In this 
approach, the density of recurrence points as well as the histograms P{1) of the lengths I of the diagonal 
and vertical lines in the RP are quantified. The density of recurrence points (recurrence rate) coincides with 
the definition of the correlation dimension [Grassberger Sz Procaccial . Il983l |. Moreover, RPs contain much 
more information about the dynamics of the systems: d ynamical invar i ants l i ke Renyi entropy o r correlation 
dimensions can be derived from the structures i n RP s Faure Sz Korn . 199a : Thiel et a/.l . |2004| | . RPs can be 
used to stu dy synchronisatio n [Romano et all l2005l : ISenthilkumar et all |2006| | or to construct surrogat e 



time series Thiel et all |2008| | and long time ser ies from ensemble me asurements Komalapriva et all l201[ll | . 
For a comprehensive introduction we point to Marwan et all 120071 ] . 



3. Pitfalls 



3.1. 



Parameter choice for recurrence analysis 

RP and RQA depend on some parameters which should be properly chosen. For the actual recurrence 
analysis, a recurrence threshold is necessary. This measure is probably the most crucial one and is discussed 
in the next subsection. 

As already mentioned, the quantification of recurrence structures depends on lines in the RP; by 
defining a minimal length of such lines, it is possible to adjust the sensitivity of line based recurrence 
measures. In Subsect. 13.31 and 13.41 we will come back to this parameter. 

If we start our recurrence analysis from a time series, we have first t o reconstruct a phase space by using 



a proper embedding, e.g., time-delay embedding Packard et aLl . ll980l ]. This involves the proper setting of 



two additional parameters: the embedding dimension m and the tim e -dela,y r. Although the estimation of 
dynamical invariants does not depend on the embedding [Thiel et all\2004 \. the RQA measures depend on 
the embedding. Standard approaches for finding optimal embedding parameters, like false nearest neigh- 
bours for embedding dimen sion and auto-correlation or mutual information for time-delay, can be widely 
found in the literature [e.g.. lKantz &: Schreibed . 119971 ]. However, it is recommended to visually cross-check 
the embedding parameters by looking at the resulting RP. Non-optimal embedding parameters can cause 
many interruptions of diagonal lines, small blocks, or even diagonal lines perpendicular to the LOI (this 
corresponds to parallel trajectory segments running in opposite time direction; Fig. [1]). The experience 
has shown that the delay is sometimes overestimated by auto-correlation and mutual information. The 
embedding dimension has also to be cons i dered with care, as it artificially increases diagonal lines (will be 



discussed in Subsect. 13. Sp [Marwan et all 120071 ] . 



In general, it is recommended to study the sensitivity (or robustness) of the results of the recurrence 
analysis on the parameters (recurrence threshold, embedding parameters). 

Although not really a parameter, it is worth to briefly discuss the different recurrence definitions. 
The most frequently used definition is the to consider neighbours in the phase space which are smaller 
than a threshold value (the re currence threshold) . Distances can be calculated using different norms, like 
Maximum or Euclidean norm [Marwan et all 120071 ] . Maximum norm is sometimes preferred because of its 
better computational efficiency (only minor differences in the results when compared to Euclidean norm). 
Another definition of recurrence considers a fixed amount nearest neighbours. This recurrence criterion 
is used when the number of neighbours is important. Pitfalls related to these recurrence criteria are also 
discussed in Subsect. 13.71 More interesting are combinations of the above criteria with dynamical properties 
of the p hase s pace trajectory, e.g., perpendicular RPs (Subsect. 13. 3p . or recurrence based on order patterns 



ep 

m 



Grothl . 120051 ]. Order patterns are representations of the local rank order of a given number d of values of 



the time series (order pattern dimension). As the number of order patterns is equal to d\, the dimension 
should not be chosen too large, because many order patterns will appear rather seldom and the RP will 
be sparse. Even d = 4 is often already not appropriate, therefore, d = 3 is the best choice in most cases 
(depending on the problem of interest, d = 2 may also be appropriate). 
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Fig. 1. Recurrence plots of the Rossler oscillator with parameters a — b — 0.25 and c = 40 using different embedding: (A) 
m = 1, r = 1, (B) m = 3, T = 12, (C) m — 3, t — 6 (adaptive recurrence threshold to ensure (A) RR = 0.1, (B, C) RR = 0.05). 
Non-optimal embedding can cause line structures perpendicular to the main diagonal, wobbly or interrupted lines (A, B). 



3.2. Recurrence threshold selection 

The recurrence threshold e is a cruci al paramete r in t h e RP analysis. Althou g h several work s have 



contributed to this discussion [e.g., iThiel et all l2002l : iMatassini et al\ . 12002 : iMarwan et all 12007 



Schinkel et all 12003], a general and systematic study on the recurrence threshold selection remains an 
open task for future work. Nevertheless, recurrence threshold selection is a trade-off of to have a small 
threshold as possible but at the same time a sufficient number of recurrences and recurrence structures. 

However, the diversity of applicability of RP based methods causes a number of different criteria 
for the selection of the threshol d: studying dynamica l properties (dynami cal invariants, synchronisation) 

twin surrogates or trajectory 
; noise corrupted observation 



requires a very small threshold [Marwan et all 120071 : iDonner et aL . 1201011 



reconstruction methods may require larger thresholds [Hirata et al 



200 



data requires even larger thresholds [Thiel et all l2002l | : for studying dynamical transitions, the threshold 
selection can be even without much importance, because the relative change of the RQA measures does 
not depend too much on it in a certain range; for the detection of certain signal s a specific frac t ion o f the 
phase space diameter (or standard deviation of the time series) can be required Schinkel et all 120081 ] . 

Several "rules of thumb" for the choi ce of e have been advocat ed in the literature, e.g., a few per cent 
of the maximum phase space diameter Mindlin Sz Gilmord. 1199211 . a value t hat should not exceed 10% 
of th e mean or the maximum phase space diameter Koebbe Sz Maver-Kresd. 1992 : Zbilut &: Webber Jr.l . 
19921 ] ■ or that the recurrence rate RR = J2i j Rij/^"^ is approximately 1% Zbilut et all 120021 ] . A recently 
proposed criterion employing the relationship between recurrence rate and e defines a n optimal v a lue by 



dRR 

de 



Gao fc Jinl . [2009| . 



using the position of the maximum of the first derivative of the recurrence rate 

Such approach can produce ambiguous and highly unstable results, as slight variations in e (as possible by 
minor errors in finding this value or by nonstationary time series) cause high variation in the recurrence 
structure. Next, the position of the maximum of -^^ depends strongly on the chosen norm and embedding, 
and may lead to an overestimation of an optimal e. And, finally, there are systems which can have more 

20ldl\ . 



than one maximum Donner et al. 



Another criterion for the choice of e takes into account that a measurem ent of a process is a composition 
of the real signal and some observational noise with standard deviation a jThiel et all 120021 ] . In order to 
get similar results as for the noise-free situation, e has to be chosen such that it is five times larger than 
the standard deviation of the observational noise, i.e., e > 5a. Although this criterion holds for a wide 
class of processes, it is difficult to estimate the amount of observational noise in the signal. 

For (quasi-)periodic pr ocesses, it has b e en sii ggested to use the diagonal structures within the RP in 
order to find the optimal e (Matassini et ad . 120021 ]. In this approach, the density distribution of recurrence 
points along the diagonals parallel to the LOI is investigated on dependence of e in order to minimise the 
fragmentation and thickness of the diagonal lines with respect to the threshold. However, this choice of e 
may not preserve the important distribution of the diagonal lines in the RP if observational noise is present 
(the estimated threshold can be underestimated). 



The selection of an optimal recurrence threshold e is not straightforward and depends on the particular 
problem and question. 



3.3. Indicators of determinism 

The length of a diagonal line in the RP corresponds to the time the system evolves very similar as during 
another time, i.e., a segment of the phase space trajectory runs parallel and within an e-tube of another 
segment of the phase space trajectory. Deterministic systems are often characterised by repeated similar 
state evolution (corresponding to a local predictability), yielding in a large number of diagonal lines in 
the RP. In contrast, systems with independent subsequent values, like white noise, have RPs with mostly 
single points. Therefore, the fraction of recurrence points forming such diagonal lines (of length / > ^min) 

DET = ±±piHH£ 11 (1) 

can be calculated and is, therefore, called determinism in the RQA. Somehow this measure can be inter- 
preted as an indication of determinism in the data. But we should be careful in using the term determinism 
in a more general or mathematical sense. In a deterministic system we can calculate the same exact state 
by using given initial conditions, i.e., there is no stochastic process involved. Different r uethods can be used 
to test for determinism in time series, e.g., a combined modellin g-surrogate approach Small &: Tsd . 120031 ] 
or an analysis of the directionality of the phase space trajectory Kaplan Sz Glasa . ll992l ]. 

High values of DET might be an indication of determinism in the studied system, but it is just a 
necessary condition, not a sufficient one. Even for non-deterministic processes we can find longer diag- 
onal lines in the RP, resulting in increased DET values. For example, the following (non-deterministic) 
auto-regressive process Xi = 0.8xi_i -|- 0.3j;j_2 — 0.25a;j_3 -|- 0.9^ (where ^ is white Gaussian noise) has 
a DET value of 0.6 (emb e dding dimension m = 4, delay r = 4, and fixed recurrence rate of 0.1). As it 



was shown in [Thiel et all 120031 ] . stochastic processes can have RPs containing longer diagonal lines just 



by chance (although very rare). Moreover, due to embedding we introduce correlations in the RP and , 
there f ore, also uncor r elated data (e.g. from white noise process) have spurious diagonal lines jThiel et all 
20061 : iMarwan et al\ . |2007| (Fig. [2]). Moreover, data pre-processing like low-passfiltering (smoothing) is 
frequently used. Such pre-processing can also introduce spurious line structures in the RP. Therefore, from 
just a high value of the RQA measure DET we have to be careful in infering that the studied system 
would be deterministic. For suc h conclusion we need a t least one further criterion included in the RP: the 
directionality of th e trajectory [Kaplan Sz Glassl. 119921 ] . One possible solution is to use iso-directional RPs 



Horai et all 120021 ] or perpendicular RPs [Choi et all 119991 ] ; if then the measure reaches DET ~ 1 for a 



very small recurrence density (i.e. RR < 0.05), the underlying system will be a deterministic one (like a 
periodic or chaotic system). 



3.4. Indicators of periodic systems 

As explained in the previous section, deterministic systems cause a high value in the RQA measure 
DET. This measure ha s been successfully used to detect transitions in the dynamics of complex sys- 
tems [Trulla et all 119961 ] . A frequently used example in order to present this ability is the study of the 



different dynamical regimes of the logistic map, where DET is able to detect the periodic windows (by 
values DET = 1). Therefore, it is often claimed that this measure is able to detect chaos-period transitions. 
However, we ca n also fi r id suc h high DET values for non-periodic, but chaotic systems. For example, 
the Rossler system [Rossleri . 119761 ] , 

' dx dv dz\ , , sx 

It' it' Tt) = (-^-^' ^ + 0-2%. 0.25 + z(x-c)), (2) 



exhibits in the parameter interval c € [35, 45] a transition from periodic to chaotic states (Fig. [3]A). But 
due to the smooth phase space trajectory and high sampling frequency (sampling time At = 0.1), the RP 
for the chaotic trajectory consists almost exclusively on diagonal line structures (Fig. |4]), resulting in a 
high value of DET, i.e., DET k. 1 (Fig. [3B). 
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Fig. 2. (A) Recurrence plot of one realisation of Gaussian white noise, calculated using embeddeding dimension m = 6, delay 
T — 1, and a recurrence threshold of e = 0.2. The embedding causes a number of long lines. (B) Correlation between a single 
recurrence point at (15, 30) and other recurrence points in the RP of white noise demonstrating the effect of embedding for a 
bogusly creation of long diagonal lines (estimated from 1,000 realisations). (C) The histogram of line lengths found in the RP 
shown in (A). The maximum length is Lmax = 17, a value, which would not be uncommon for a deterministic process. 
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Fig. 3. (A) 1st and 2nd positive Lyapunov exponents of the Rossler oscillator with parameters a = b = 0.25 and c £ [35, 45]. 
A periodic window occurs between c — 36.56 and c = 37.25. However, the DET measures reveals an almost constant very 
high value of approximately DET = 0.94. Used RP parameters: dimension m = 3, delay r = Q, adaptive recurrence threshold 
to ensure a RR = 0.05. 



A very high value of DET is not a clear or even sufficient indication of a periodic system. High 
values can be caused by very smooth phase space trajectories. This should also be considered when looking 
for indications of unstable periodic orbits (UPOs), where DET or mean and maximal line lengths L 
and Ljnax may not be sufficient. A solution could be to increase the minimal length /min of a diagonal 
recurrence structure which is considered to be a line. However, a better solution is to look at the cumulative 
distribution of the diagonal line lengths and estimate the K2 entropy (but this requires much longer time 
series, cf. Subsect. I3.9p . Recent work has shown that measures coming from complex network theory, like 
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Fig. 4. Recurrence plot of the Rossler oscillator with parameters a — b = 0.25 and c = 40. For this parameters, the Rossler 
system is in a chaotic regime (Ai = 0.14), but the RP consists almost only on diagonal lines. Used RP parameters: dimension 
m = 3, delay r = 6, adaptive recurrence threshold to ensure a RR = 0.05. 

clustering coefficie nt, applied to recurrence matrices are more powerful and reli able for the detection of 
periodic dynamics Marwan et ali . l2009l : IZou et all Isubml iDonner et all IXXXX | . 



3.5. Indicators of chaos 

The RP visualises the recurrence structure of the considered system (based on the phase space trajectory). 
The basic idea behind RPs comes, in general, from the study of chaos. Therefore it can be considered as 
a nonlinear tool for data analysis. But this cannot be a criterion to understand complex structures in the 
RP or high values of RQA measures as indicators of chaos or nonlinearity in the dynamical system. 

As mentioned above, uncorrelated stochastic systems have mostly short or almost no diagonal line 
structures in their RPs, whereas deterministic and regular systems, like periodic processes, have mostly 
long and continuous diagonal line structures. Chaotic processes have also diagonal, but shorter lines, and 
can have single recurrence points. Nevertheless, only by looking at the appearance of an RP it is difficult 
(almost impossible) to infer about the type of dynamics; only periodic and white noise processes can be 
identified with some certainty. 

The alternative is to look at the RQA measures quantifying the structures in an RP which are related 
to some dynamical characteristics of the system. As diagonal lines in the RP correspond to parallel running 
trajectory segments, it is clear that the length of these lines is somehow related to the divergence behaviour 
of the dynamical system. Divergence rate of phase space trajectories is measured by the Lyapunov exponent. 
In fact, the lengths of the diagonal lines are directly related to dynamical invariants as K2 entropy or D2 
correlation dimension Faure Sz Kornl . Il998l : iThiel et all l2004l ] . The K2 entropy is the lower limit of the 
sum of the positive Lyapunov exponents. 

For example, RQA measures based on the length of the diagonal lines, like determinism DET and mean 
line length L, also depend on the type of the dynamics of the systems (rather low values for uncorrelated 
stochastic (white noise) systems, higher values for more regular, correlated and also chaotic systems). 
It has been suggested to measure the length of the longest diagonal l ine -Lmay an d inter pret its inverse 



Trulla et al\ . Il996l |. However, this 



DIV = 1/Lmax as an estimator of the maximal Lyapunov exponent 

interpretation incorporates high potential of erroneous conclusions derived from RQA. 

First, the main diagonal in the RP (i.e., the line of identity, LOI) is naturally the longest diagonal 



line, wherefore it is usually excluded from the analysis. However, due to the tangential motion of the phase 
space trajector}!^, subsequent phase sp ace vectors are often also considered as recurrence points (known 
as sojourn points) [Marwan et aLl . l2007l |. These recurrence points lead to further continuous diagonal lines 
directly close to the LOI. Without excluding an appropriate corridor along the LOI (the Theiler window), 
Lmax will be artificially large (~ N) and DIV too small. 

Second, as explained above, even white noise can have long diagonal lines Thiel et aLJ . 120031 ] . leading 
to a small DIV value just by chance (Fig. [2|). Although the probability for the occurrence of such long 
lines is rather small, the probability that lines of length two occur in RPs of stochastic processes is, on the 
contrary, rather high. Only one line of length two is enough to get a finite value of DIV which might be 
mis-interpreted as a finite Lyapunov exponent and that the system would be chaotic instead stochastic. 

Therefore, we have to be careful in interpreting the RQA measures themselves as indicators of chaos. 
Moreover, such conclusion cannot be drawn by applying a simple surrogate test where the data points are 
simply shuffled (such a test would only destroy the correlation structure within the data, and, thus, the 
frequency information). 

RP or RQA alone cannot be used to in fer nonlinearity from a time series. For this pu rpose, advanced 
surrogate techniques are more appropriate Schreiber &: Schmita . 12000; iRapp et al\ . l200ll | . 



3.6. Discrimination analysis and detection of deterministic signals 

RQA is also a powerful tool in order to d i stinguish between different types of signals, differ e nt groups of 



dyna mical regimes etc. [e.g., IZbilut et al\ . Il998l : iMarwan et al\ . l2002bl : iFacchini et all 120071 : iLitak et al. 



20101 ]. However, the selection of applicable RQA measures is a crucial task. Not all measures will be useful 
for all questions. Their application needs justification in terms of the purpose of the intended analysis. For 
example, for processes which does not contain laminar regimes, or if we are not interested in the detection 
of such laminar regimes, it would not mak e sense to use RQA mea sures basing on vertical recurrence 
structures (like laminarity or trapping time) [Marwan &: Kurthd . 120091 ] . 



3.7. Indicators of nonstationarity and transition analysis 

RQA is powerful for the analysis of slight changes and transitions in the dynamics of a complex system. 
For this purpose we need a time-dependend RQA (a RQA series) what can be realised in two ways (Fig. [5]): 

(1) The RP is covered with small overlapping windows of size w spreading along the LOI and in which 
the RQA win be calculated, Rij\^^^i:^- 

(2) The time series (or phase space trajectory) is divided into overlapping segments Xi\^^^~ from 
which RPs and subsequent RQA will be calculated separately. 

Such time dependent approach can also be used to analyse the stationarity of the dynamical system. 

Here we should note the following important points. The time scale of the RQA values depends on the 
choice which point in the window should be considered as the corresponding time point. Selecting the first 
point k of the window as the time point of the RQA measures allows to directly transfer the time scale of 
the time series to the RQA series. However, the window reaches into the future of the current time point 
and, thus, the RQA measures represent a state which lies in the future. Variations in the RQA measures 
can be misinterpreted as early signs of later state transitions (like a prediction). A better choice is therefore 
to select the centre of the window as the current time point of the RQA. Then the RQA considers states in 
the past and in the future. If strict causality is required (crucial when attempting to detect subtle changes 
in the dynamics just prior the onset of dramatic state changes), it might be even useful to select the end 
point of the window as the current time point of the RQA (using embedding we have to add (m — l)r — 1). 
For most applications the centre point should be appropriate. 

Another important issue can rise from the different windowing methods (1) or (2), which are only 
equivalent when we do not normalise the time series (or its pieces) from which the RP is calculated and 
when we chose a fixed threshold recurrence criterion. If we normalise the time series just before the RP 



Tangential motion becomes even more crucial and influential for highly sampled or smooth systems. 
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Fig. 5. Two possibilities of windowed RQA: (A) Windowing of time series and (B) windowing of RP. The example is an 
auto-regressive process: Xi = 0.952;i_i + 0.052;i_2 + 0.9^ (where ^ is white Gaussian noise), the RP is calculated using a 
constant number of neighbours (10% of all points) and without embedding. The sub-RPs at the bottom clearly demonstrate 
the differences between the two approaches. 



Table 1. Selected RQA measures derived from windowing of time series (left) and windowing of RP (right) of 
an auto-regressive process and windowing as shown in Fig. (5] 

Window 1-250 251-500 501-750 751-1000 Window 1-250 251-500 501-750 751-1000 
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calculation, we get differently normalised segments resulting in different sub-RPs (and thus different RQA 
results) than such derived directly by moving windows from the RP of the entire time series (Fig. [5] and 
Tab. [T|). A similar problem arises when we use a fixed number of nearest neighbours for the definition of 
recurrence, because it is a big difference considering the entire time series in order to find the k nearest 
neighbours or just a small piece of it. Nevertheless, both approaches (1) and (2) can be useful and depend 
on the given question. If we know that the time series shows some nonstationarities or trends which are 
not of interest, then approach (2) can help to find transitions neglecting these nonstationarities. But, if we 
are interested in the detection of the overall changes (e.g., to test for nonstationarity) , we should keep the 
numerical conditions for the entire available time constant and chose approach (1). Anyway, for each RQA 
we should explicitly state how the windowing procedure has been performed. 

The choice of the window size itself needs the same attention. Because the RQA measures are statistical 
measures derived from histograms, the window should be large enough to cover a sufficient number of 
recurrence lines or orbits. A too small window can pretend strong fluctuations in the RQA measures just 
by weak statistical significance (the RQA measure TREND is very sensitive to the window size and can 
reveal even contrary results, cp. Fig. [7J3)- Therefore, conclusions about nonstationarity of the system should 
be drawn with much care. Moreover, statements on stationarity of the system itself are questionable at all 
(if not enough knowledge about the system is available), because detected nonstationarity in an observed 
finite time series does not mean automatically nonstationarity in the underlying system. For example, an 
auto-regressive process is stationary by definition, but its RP and RQA can reveal a nonstationary signal 
(Figs. El and EI). 
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Fig. 6. RP of the same auto-regressive process as presented in Fig. \5\ which is by definition stationary. The RP is calculated 
using maximum norm, e — 2 and without embedding. 



A 0.15 



DC 
DC 




B 



0.05 



400 600 

Time (a.u.) 



1000 




400 600 

Time (a.u.) 



1000 



Fig. 7. Two exemplary RQA measures, (A) recurrence rate RR and (B) paling trend (TREND), of the auto-regressive 
process as presented in Fig. [S]for three different window sizes 10(111 = 75, 150, 250). (A) The strong variation in RR pretends 
a nonstationarity in the signal. (B) TREND depends rather strongly on w, resulting in contrary outcomes, e.g., revealing 
high values for w = 250, but small values for ui = 75 at the same time period i = 700 . . . 800. The RQA is calculated using 
maximum norm, e = 0.3 and without embedding (the windows are moved by w/2, i.e., 50% overlap; the RQA time point is 
set to the centre of the RQA window). 



3.8. Significance of RQA measures 

Related to the preceding issue on windowed RQA is the question on the significance of the RQA variation. 
A sub-optimal scaling of the variation of the RQA measures can mislead to conclusions that the studied 
system has changed its regime or that it would be nonstationary (Fig. [5]A, B). Therefore, it is strongly 
recommended to cross-check the scaling of the presentation and to present confidence intervals (Fig. [8lC, 
D). Confidence intervals can be calculated in various ways, but we should avoid to derive them by simply 
shuffling the or i ginal data. One approach co uld be a bootstrap resampling of the line structures in the RP 
[Marwan et all 120081 : ISchinkel et all l2009l ]. Another approa c h fits th e probability of serial dependences 
(diagonal lines) to a binomial distribution Hirata &: Aiharal . IXXXXl ] . Whatever approach we chose, the 
estimation of the confidence intervals is not a trivial task, but in the future the standard software for RQA 
should include such tests. 
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Fig. 8. Two exemplary RQA measures, (A, C) determinism DET and (B, D) laminarity [LAM), of the auto- regressive 
process as presented in Fig. (5] (A, B) The scahng of the j/-axis is affecting a strong variation in the RQA measures - a 

i )otential of wrong co nclusions. (C, D) Considering a 5% confidence interval of the RQA measures (details can be found in 
Marwan et aZ.L 120081 ]) and a better value range for the y-axis, we cannot infer that the values of the RQA measures as shown 
in (A) and (B) significantly vary. The RQA is calculated using a window size of to = 250 and a window step of ws — 20, using 
maximum norm, e — 0.3 and without embedding (the RQA time point is set to the centre of the RQA window). LAM is the 
fraction of recurrence points forming vertical lines in an RP (analogously as DET for the diagonal lines). 

A common statement on recurrence analysis is that it is useful to analyse short data series. But we 
have to ask, how short is short? The required length for the estimation of dynamical invariants will be 
discussed in the following Subsect. Applying RQA analysis we should be aware that the RQA measures 
are statistical measures (like an average) and need some minimal length that a variation can be considered 
to be significant. 



3.9. Dynamical invariants from short time series 

An RP analy sis is appropriate for analysing short and nons t ationary tirne serie s, as it is often stated in 
many reports Fabretti Sz Auslood . l2005l : ISchinkel et aU 120071 : IZbilut et aUll998l |. However, this statement 
holds actually only for the heuristic measures of complexity as introduced for the RQA or for the detection 
of differences or transitions in data series. If we are interested in the dynamical invariants derived from 
RPs, the length N of the time series becomes a more crucial part like it is for the standard methods of 
nonlinear data analysis. 

The derivations of dimensions {Di, D2) and dynamical invariants (like K2) from the RPs hold only 
in the limit A^ — )■ 00 and small e (e — > 0). Nevertheless, an estimation of dynamical invariants from 
shorter time series can be feasible. We have to regard the following factors if discussing the time series 
length: the number of orbits representing stretching, the number of recurrences filling out a sufficient part 
of the attractor, and the number of data points necessary for an acceptable phase space reconstruction 
Wolf et all Il985l |. Since these factors may require different minimal lengths, the largest of these lengths 
should be considered. 

For example, numerical considerations for the estimation of the at t racto r (correlation) dimension 
D2 using the Grassberger-Procaccia algorithm Grassberger &: Procaccial . Il983l | lead to the requirement 
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log A^ > -^ log(-) (where q = 7- is the fraction the recurrence neighbourhood of size e covers on the entire 
phase space of diameter S) [Eckmann fc Ruelld . Il998l |. Considering a ^ = 0.1 and a decimal logarithm, for 
finding a D2 = 10 we need at least A^ = 100,000 data points. Furthermore, a ^ = 0.1 is actually too large 
and we need much smaller e, which consequently provokes that again a larger N is required. 

For Lyapunov exponents (and analogously for K2), a rough estimate based on the mentio ned require- 



ment s suggests minimal time series lengths of 10^^ to SO''^^ (with attractor dimension D2) [Wolf et al. 
19851 ]. Accordingly, a system wi th D2 = 3 requires 1000- 30,000 data points (a more strict consideration 
even requires logA^ > D2\og{\) [Eckmann &: Ruelld . Il998l |). 



Therefore, too guarantee useful results we need long time series. If we calculate dimensions or K2 from 
short time series the results are probably worthless. 

3.10. Synchronisation and line of synchronisation 

Cross recurrence plots (CRPs) ca n be used for the investigation of the siinultaneous evolution of two 
different phase sp ace trajectories Marwan et all l2002al : iMarwan fc Kurthsl . I2OO2I : IZolotova et all l2009l : 
Ihrke et all |2009| |. The line of identity (LOI) in the RP becomes a line of synchronisation (LOS) in the 
CRP. Two more-or-less identical systems but \y ith differences on the time-scale will reveal a bowed LOS 
Marwan et all l2002al : iMarwan fc Kurthsl |2005[ | . An off-set of the LOS away from the main diagonal is an 



indication of a phase shift or a delay between the two considered systems. 

However, because this method tests if the two trajectories visit the same region in the phase space, it can 
be used only to study complete synchronisation (CS) or a kind of a generalised correlation (although with 
possible delays), or to get the relation between the transformations between their time-scales. Moreover, 
the data under consideration should be from the same (or a very comparable) process and, actually, should 
represent the same observable. Therefore, the reconstructed phase space should be the same. 

For the study of the LOS the distance matrix may be more appropriate because it contains more infor- 
mation, especially if the data series sh ow nonstationarities. T hen, the LOS can be found by using efficient 
algorithms like dynamic time warping Sakoe &: Chibal . ll978l |. Nevertheless, it is always very important to 
check if the found LOS makes sense; for inst ance, it is possible to find several LOS (cp. Application in 



magneto-stratigraphy in Marwan et all |2007[ | ) 



3.11. Macrostructures and sampling 

For the visual interpretation of an RP and also for a reliable RQA we should remember that our data 
are discretised time or data series. The sampling of the signal has an importance which should not be 
underestimated. If the sampling frequency is just one magnitude higher than the system's main frequencies, 
and their ratio is not a multiple of an integer (i.e., we have an intrinsic phase error), an interference triggered 
by the sampling of the continuous signal can produce large emp t y regi ons in the recurrence matrix, although 
they should be there Facchini et all l2005l : iFacchini &: Kanta . [20071]. Nonstationarities or modulations in 
frequency or phase cause non-trivial gaps or macrostructures in the recurrence matrix (Fig. [9|). We should 
be aware that such gaps can occur in particular when we use a low sampling frequency. The recurrence 
structure of interest can appear rather different; diagonal lines can vanish or can be reduced to just single 
points yielding in biased RQA measures (Fig. [T0|) . 

Nevertheless, tiny modulations in frequency or phase in oscillating signals can be detected by RPs, 
which are non-detectable by standard methods (spectral or wavelet analysis). This turns RPs to a powerful 
tool for the analysis of slight modulations in oscillatory signals like audio signals. 

Please note that macrostructures are also an apparent problem when displaying large RPs on a com- 
puter screen (and up to a certain amount on print outs). The resolution of modern computer screens is 
around 72 ppi (points per inch, 72 ppi corresponds to around 28 points per centimetre). The presenta- 
tion of RPs in a window of, e.g., 6 inch allows only the display of around 430 points. Larger RPs will be 
rendered using downsampling or interpolation, resulting in similar interference effects and artificial sec- 
ondary macrostructures as described above; such macrostructures will even change for different window 
sizes (Fig. Ilip . Therefore, we should take care in visual interpretation of patterns found in large RPs which 
are represented on computer screens. 
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Fig. 9. RP of a modulated harmonic oscillation sin(27rl000(7r + t) + 27rsin(27r44t)t). (A) Non-trivial macrostructures (gaps) 
in the RP due to the interference of the sampling frequency of 1 kHz and the frequency of the modulated harmonic signal. (B) 
Corresponding RP as shown in (A), but for a higher sampling frequency of 10 kHz. As expected, the entire RP now consists 
of the periodic line structures due to the oscillation. Used RP parameters: dimension m — 3, delay r = 1, recurrence threshold 
e — 0.05ct, Z/oo-norm. 
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Fig. 10. RP of the i-component of the Rossler oscillator, Eq. ((2)1, with parameters a — b — 0.2, c — 5.7. The sampling time 
is (A) At — 0.05 and (B) At = 1. The embedding was chosen in both settings to be equivalent: dimension in (A) and (B) is 
m = 3, the delay in (A) r = 20, but in (B) r — 1; recurrence threshold e = 1.5 (maximum norm). Due to the low sampling in 
(B), many diagonal lines vanish. 



4. Conclusions 

We have illustrated several problems regarding the application of recurrence plots (RPs) and recurrence 
quantification analysis (RQA) which need our attention in order to avoid wrong results. The uncritical 
application of these methods can yield to serious pitfalls. Therefore, it is important to understand the basic 
principles and ideas behind the measures of complexity forming the RQA and the diff'erent techniques to 
study the numerous phenomena of complex systems, like transitions, synchronisation, etc. Nevertheless, 
the recurrence plot based techniques are still a rather young field in nonlinear time series analysis, and 
many open questions remain. For example, systematic research is necessary to define reliable criteria for 



REFERENCES 13 



RIe £dit V^iew [nsert Tools Window Help 






II D a^ H # 1 Dj 1 St e^ O ® h+= 1 n S| 




dl 




^ 


04S 


'■' ,■■'■■'■■■ ...,'■■'...:.■■' 






0.04 


- .■:■[■. .:/■■■ ;;■■ ■.;■■■■■■ 


.::;: :■: 




Q03S 




' 




0.05 


-■■:'■. .:.■•■■■■'. 


.:■■ y- 




D.025 




. ■ .■■' .- 




02 


.:■ . . . ..:■'■■ ... 


.;■■■ ..- 




001S 


-.;■■ -■■.;-:■- 


:■ ■ .:■ , 




01 




y. 




005 




■ .'■■■ 







/ : ■■'/; , y, /.' , ,//■'' 


.^, / 




0,OQ5 0,01 0,015 0,02 0.025 0.03 0,035 0.04 0.045 0.05 


■ 




^^ 



Fig. 11. Screenshot of the RP as shown in Fig. [S)3 for two different window sizes of display on a computer screen (using 
Matlab®). Although tlie RP consists only on continuous diagonal lines as represented in Fig. [9)3, its size (A'' = 5511) exceeds 
the screen resolution and requires downsampling, leading to artifical macrostructures. 

the selection of the recurrence threshold, and the estimation of the confidence of the RQA measures will 
be a hot topic in the near future. 
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